首页> 外文OA文献 >Effect fusion using model-based clustering
【2h】

Effect fusion using model-based clustering

机译:使用基于模型的聚类进行效果融合

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

In social and economic studies many of the collected variables are measuredon a nominal scale, often with a large number of categories. The definition ofcategories is usually not unambiguous and different classification schemesusing either a finer or a coarser grid are possible. Categorisation has animpact when such a variable is included as covariate in a regression model: atoo fine grid will result in imprecise estimates of the corresponding effects,whereas with a too coarse grid important effects will be missed, resulting inbiased effect estimates and poor predictive performance. To achieve automatic grouping of levels with essentially the same effect, weadopt a Bayesian approach and specify the prior on the level effects as alocation mixture of spiky normal components. Fusion of level effects is inducedby a prior on the mixture weights which encourages empty components.Model-based clustering of the effects during MCMC sampling allows tosimultaneously detect categories which have essentially the same effect sizeand identify variables with no effect at all. The properties of this approachare investigated in simulation studies. Finally, the method is applied toanalyse effects of high-dimensional categorical predictors on income inAustria.
机译:在社会和经济研究中,许多收集到的变量是按名义规模测量的,通常具有很多类别。通常,类别的定义不是明确的,并且可以使用更细或更粗的网格进行不同的分类。当将这样的变量作为协变量包含在回归模型中时,分类会产生影响:精细的网格将导致对相应效果的不精确估计,而网格太粗则将错过重要的影响,从而导致效果估计偏向且预测性能差。为了实现基本上具有相同效果的级别的自动分组,我们采用贝叶斯方法并将级别效果上的先验指定为尖峰正常分量的分配混合。混合权重先验地导致了水平效应的融合,这鼓励了空组分。MCMC采样过程中基于模型的效应聚类允许同时检测具有基本相同效应大小的类别,并识别完全没有效应的变量。在仿真研究中研究了这种方法的性质。最后,该方法用于分析高维分类预测变量对奥地利收入的影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号